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Abstract. We generalize a well-known result of L. Caffarelli on Lipschitz 
estimates for optimal transportation T between uniformly log-concave prob- 
ability measures. Let T : R d — ► M. d be an optimal transportation pushing 
forward = e~ v dx to v = e~ w dx. Assume that 1) the second differential 
quotient of V can be estimated from above by a power function, 2) modulus 
of convexity of W can be estimated from below by j4 9 |x| 1+ij , q > 1. Under 
these assumptions we show that T is globally Holder with a dimension-free 
coefficient. In addition, we study optimal transportation T between /i and the 
uniform measure on a bounded convex set K C K 1 *. We get estimates for the 
Lipschitz constant of T in terms of d, diam(K) and DV, D 2 V. 
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1. Introduction 

According to a well-known result of L. Caffarelli [9] any optimal transportation 
mapping T pushing forward the standard Gaussian measure 7 to a probability 
measure e~ w ■ 7 with convex W is 1-Lipschitz. This remarkable observation al- 
lows to recover many interesting results on analytic properties of uniformly convex 
measures. For instance, the Bakry-Emery condition for the log-Sobolev inequality 
("flat" part) Q], Bakry-Ledoux comparison theorem [2], some correlation inequali- 
ties [9], [13]. Some recent generalizations of [9] can be found in |24) . 

The estimates of this type go back to A.V. Pogorelov [19] (see also "Pogorelov 
lemma" in fT2] ). According to his result, any the smooth solution of the Monge- 
Ampere equation det D 2 ip = 1 in a smooth domain 51 C M. d satisfies 

Cit^fi') < D 2 tp(x) < C 2 (fi,fi')> 
x G fi', where f2' C fl is another domain. The Holder estimates play fundamental 
role in the regularity theory of partial differential equations. The overview of results 
on a priori estimates for fully nonlinear differential equations see in [15] , |10) . For a 
more special case of the Monge-Ampere equation the readers are advised to consult 
[H], EE El]. 

On the other hand, mass transportation method is widely used in probability 
for establishing diverse isoperimetric and concentration properties. Given a "good" 
measure fi with known isoperimetric properties (e.g. Gaussian, product measures) 
and another measure v one can try to transform \x into v and deduce the desired 
information about v from the properties of the mass transport (Lipschitz or Holder 
estimates). Let us indicate some typical situations 
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1) fi is Gaussian and v is another product measure (see [21] . |16j ) 

2) v is the uniform measure on a convex set and fi is log-concave with the 
same modulus of convexity (see, for instance, [5], [16] , |17j . and references 
therein) 

3) fi is Gaussian and D 2 W > C (situation of the Caffarelli theorem). 

The paper is organized as follows. In Section 2 we consider the optimal trans- 
portation T pushing forward e~ v dx to e~ w dx. Assume that V and W satisfy 

V(x + y) + V(x - y) - 2V(x) < A p \y\ p+ \ 
W{x + y) + W(x -y)- 2W(x) > A q \y\ q+1 
with < p < 1 < q and some positive A p ,A q . We show that T is globally £±j- 
Holder: 

\T(x)-T(y)\ <C\x-y\^, 
with C depending only on p, q, A p ,A q . 

In particular, this implies the following. Let v = e~ w dx be a probability measure 
satisfying 

W(x + y) + W(x - y) - 2W{x) > C p \y\ 213 

with f3 > 1. Then there exists C depending only on /3, Cp such that v satisfies the 
following concentration inequality: 

for every B C R d with v(B) > \, B r = {y : \x - y\ < r, x e B}. This is 
a previously known consequence of a modified log-Sobolev inequality proved by 
Bobkov and Ledoux [5] . The Bobkov-Ledoux result has been generalized in pj] by 
transportation arguments. The corresponding isoperimetric inequalities have been 
proved in [17] by localization techniques. 

In Sections 3-4 we prove estimates of the following type: 

\\dM < /(vv»), 

where / is a non-negative function and Vt^ is the optimal mass transport pushing 
forward a smooth measure fi to v = jrg) ' Mk, where K C is convex and A 
is the Lebesgue measure. In particular, we show that any uniform measure v on 
a bounded convex set if is a Lipschitz image of the standard Gaussian measure 
which Lipschitz constant does not exceed C\fd ■ diam(if) with some universal C. 

After publishing online a preliminary version of this paper, the author obtained a 
remark from the participants of the Convex Analysis seminar in Tel- Aviv University. 
They noticed that the proof of Proposition 12.71 can be significantly simplified for 
the case of convex functions. The arguments are presented in Appendix. Consider 
a log-concave measure v — e~ w dx such that 

W(x + y) + W{x -y)- W{x) > S(\y\) 

with a non-negative increasing function 8. Then the optimal transportation of the 
standard Gaussian measure to v satisfies the following: 

\V<p(x)-V<p(y)\ <8^ 1 (4| s -y| 2 ). 

This implies, in particular, that v admits the following dimension-free concentration 
property: 

y(B r ) >l-iexp(-i*(r/8)) 
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for every B C R d with v{B) > 1/2, B r = {y : \x - y\ < r, x £ B}. 

The author expresses his gratitude to the colleagues from the Tel- Aviv university, 
especially to Ronen Eldan and Sasha Sodin for communicating the proof of Lemma 
15.11 and to Emanuel Milman for pointing out a mistake in the earlier version. 

2. Global Holder estimates 

We deal throughout the paper with the standard finite-dimensional Euclidean 
space. 

We recall that for every couple of probability measures p\ = p\dx, p% = pidx 
there exists a mapping T : W d — > R d pushing forward p\ to p 2 which has the form 
T = V<^, where <p is a convex function (see [25] for details). For smooth T one has 
the following change of variables formula (Monge- Ampere equation) : 

p 2 (V</?) det D 2 ip = pi. 

All the measures considered below are supposed to have a convex support of 
positive Lebegue measure. 

Recall that a probability measure p is called log-concave if it has the form p = 
e~ v dx with a convex V. Finally, recall that a mapping T : W 1 — > M. d is M-Lipschitz 

\x - y\ 

By the Rademacher theorem T is almost everywhere diffcrcntiable with ||-DT|| < M. 
A mapping T is a-H61der if there exists < a < 1 such that 

\x-y\°> ~ 

We start with the one-dimensional case. Let T = ip' be the optimal transporta- 
tion, pushing forward p = e~ v dx to v = e~ w dx with supp(^) = [a, b], where 
V : K -> R and W : (a, b) -> K satisfy 

(1) \V'(x)-V'(y)\<C p \x-y\*, 

(2) (x - y)(W'(x) - W'(y)) > C q \x - 

for some < p < 1, 1 < q. 

Measures p and v are supposed to be probability measures. We are looking for 
a maximum point (£0,^0) of the following function 

F(t,x) = ^ a ^ y ; signft), < a < 1. 

Remark 2.1. The arguments below are non-rigorous. A more general version of the 
result (with a rigorous proof) see in Corollary 12.91 

Setting t' = —to and x' = xq — to if necessary, we may assume that t > 0. 
Differentiating in x at the maximum point yields 

ip"(x Q + to) = <p"{X(i), 

<p'"(xo + t ) < <p'"(x ). 

Hence 

(3) <P'"(XQ + t ) _ tp"'(x ) < 

1 1 <p"{x +t ) ip"(x ) - 
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Taking derivative in t we obtain 

ip"{x + t ) = — (V(x + t ) - <p'(x ) 
t v 

Let us differentiate the change of variables formula. One gets 

Using we obtain 
V'(x + t ) - V'{x ) > (W'(<p(x + t )) - W'(<p'(x )))<p"(x + t) 

= - \W'(<p(x + t )) - W'(tp'(x ))\ [ip'(x + t ) - <p'{x ) 
Then it follows from the assumptions on V and W that 

C p tg +1 > aC q \ip'(x + t ) - <p'(x )\ q+1 . 

Hence 



'<2 

Thus we get the following statement: 

Theorem 2.2. Assume that V : M — >• M and W : R — > IR are continuously differ- 
entiable functions satisfying (QP and (0) /or some < p < 1, q > 1. Set: 

p+1 
< a := f—— < 1. 
0+ 1 " 



Then 



Remark 2.3. A closed result has been established in [3|: any optimal mapping 
sending the one-dimensional exponential measure v = ^e~^dx to e~ v ■ v with 
|V| < 1 - c, c> is i-Lipschitz. 

We assume that V and satisfy 

(4) V(x + y) + V(x -y)- 2V(x) < A p \y\P +1 , 

(5) W(x + y) + W(x -y)- 2W{x) > A q \y\ g+1 , 
for some < p < 1, 1 < q, A p > Q, A q > 0. 

Remark 2.4. It can be easily verified that assumptions (@|, © only make sense if 
< p < 1, 1 < q. Indeed, assuming other values of p, q, one can easily deduce that 
the second derivatives of W, V are either zero or infinity everywhere. 

Theorem 2.5. Let /i and v satisfy f5|) for some < p < 1, 1 < q. Then tp 
satisfies 



(6) tp(x + th) + tp(x -th)-2p{x)<2(-f) q+1 t 1+a 

V Aq I 

for every unit vector h € W 1 with a = 2±- . 
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Proof. To prove the multi-dimensional case we follow the arguments of Caffarelli 
from [5]. We consider the differential quotient 

52<p(x) = tp(x + th) + ip(x — th) — 2<p(x) > 

for some vector del' 1 with \h\ = 1. Without loss of generality we may assume 
that the support of v is a bounded convex domain. Note that according to a result 
from [7] cp is twice continuously differentiable. It follows from the arguments of 
[9] that linx^oo 8-2<f{x) = 0. Thus there exists a maximum point xq of 6%ip{x). 
Differentiating at x Q yields 

(7) Vp(x Q + th)+V(p(x Q -th) = 2Vip(x ), 

D 2 f{x + th) + D 2 f{x - th) < 2D 2 f{x ). 
It follows from the concavity of determinant that 

'D 2 p(x + th) + D 2 p(x - th)^ 



det D 2 f(x Q ) > det(- 



2 

> (det D 2 p(x Q + th) dct D 2 (p(x - thfj \ 
Applying the change of variables formula det D 2 ip = e w (Vv)-V one g na jiy g e t s 
(8) 

V(x Q + th) + V{x - th) - 2V(x Q ) > 

W(Vp(x Q + th)) + W(Vp{x - th)) - 2W(ytp{x )). 

It follows from (0 that v := Wf(xo + th) — Vf(xo) = ~Vip(xo) — Vf(xo — th). 
Hence we get by (JSJ) that 

A p t p+1 > A q \Vip{x +th)-V<p{x Q )\ q+l = A q \Vip(x -th)-V<p(x Q )\i +1 = A q \v\ q+1 . 
By convexity of f 

f(xo + th) + f(xo — th) — 2cp(xo) < t(Vy(xo + th) — Vip(xo — th), h) 

= 2t(v,h) < 2t\v\. 

Finally 

□ 

Remark 2.6. For the case p = q = 1 the estimate is known to be slightly better: 

(A* 
V 

Our next goal is to establish Holder continuity of Vip. 

Proposition 2.7. Let u : R d -> R be a differentiable function satisfying 

\u(x + y) + u(x -y)- 2u(x)\ < C\y\ a+ \ 

a > and u, |Vu| be integrable with respect to every Gaussian measure. Then there 
exists a constant C depending only on a, C and d such that 

\u v (x + y)-u v (x)\<C'\y\ a 

for every unit vector v £ R d and every x,y £ R d . 
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A ~ 

f(x + th) + ip{x - th) - 2f(x) < 2 t 2 . 

V An / 



Proof. It is known that every bounded / satisfying 

\f(x + y) + f(x-y)-2f(x)\<C\y\ a+1 

admits a Holder continuous derivative: |V/(a; + y) — V/(a;)| < A\y\ a (see [20] . 
Chapter 5(4), Proposition 9). Below we give a modification of the proof from [2D] . 
Let us consider the heat semigroup acting on u. 

Ptu = J u(x-y)pt(y)dy, p t {y) = ^ 2jTt y /2 e 

We fix an ortho normal basis {e;}. Clearly, it is sufficient to prove the statement for 

\u Xi (x + htj) - u Xi (x)\ 

with any 1 < i,j < d and some fixed h > 0. Note that \u Xi (x + hej) — it Xi (x)| does 
not exceed 

\PtU Xi (x + hej) - P t u Xi (x)\ + \u Xi (x + he 3 ) - P t u Xi (x + he 3 ) \ + \u Xi (x) - P t u Xi (x)\, 

for every t > 0. 

The first term is estimated by 

fh 



P f u Xi (x + hej) - P t u Xi (x) = / u XiXj (x + rej - y)p t (y) dy dr 



JR d 



1 

2 Jo 



1 

< - 
~ 2 



Obviously 



u(x + rej - y)pt{y) yi y 3 dy dr 
(u(x + rej - y) + u(x + rej + y) - u(x + rej))p t {y) yiy] dy dr 
\u(x + rej -y) + u(x + rej + y) - u(x + re 3 )\\p t (y) yiyj | dy dr. 

Pt{y) yiyj = \ - $ij)pt(y)- 



Hence 



\Ptu Xi {x + hej) - P t u Xi (x)\ < 



\l+a 



ViVj 



Pt(y) dydr. 



It is clear by scaling arguments that the latter does not exceed 

Ci{C,a,d)h ■ V^r . 

Thus we get for t = h 2 : 

\P t u Xz {x + he 3 ) - P t u Xz (x)\ < d(C,a,d)h a . 
Let us estimate the remaining terms. To this end we consider (P t u) t . One gets 

dy 



(P t u) (x) = u(x- y) 



.{2nt) d / 2 

1 f ^ d 

2 Ld ~ V ^ + U ^ X + y ^~ 2u ( x i) (|^2 ~ 2t) Pt ^ dy - 



Hence 
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(9) \{Ptu) t (x)\ <tj | tf |«+i ^ - I C,[C. n.,1) ■ I~r 



Integration yields 

(10) \P t u(x)-u(x)\<C 3 (C,a,d)-t B ^. 
Note that 

u Xi (x) - P t u Xz (x) = - I {P s u) sx ds, 
Jo 

P S U = P s/2 U * p s/2 - 

Differentiating the convolution identity yields 

Ptu Xi {x) - u Xi {x) = I ( (P s / 2 u(y)) s (p s/2 (x - y)) x% dy)ds 
Jo vk<* ' 

+ (/ Ps/2u{y){p s /2{x-y)) XlS dy)ds. 

Jo VR* ' 

It is easy to check that 

t , r v 

P s /2u(y)(p s / 2 (x - y)) XlS dyjds 

( / (P s /2U-u)(y)(p s / 2 (x-y)) XiS dy)ds + P t / 2 u Xi (x)-u Xi (x). 

One has for t = h 2 

PtU Xi (x) - P t/2 u Xi {x) = I (/ (P s / 2 u) (y)(p s / 2 (x - y)) Xl dy)ds 
Jo K Jm d ' 

+ / ( / i P s/2U - u)(y)(p s/2 (x - y)) XtS dy)ds. 
Jo K Jm d ' 

Then it follows from (JTSJ), © that 



|Pftia: 4 (a;) - P t / 2 u Xi (a;)| < 

Ps/2(x - y)) Xi \dyds + / |p g / 2 (a; - y)) SiS |dyrfsJ. 

./n ./rod ' 



ft _ r rt 

CA \ 



Applying the same arguments as above we obtain 

r-l 



\PtU Xi (x) - P t / 2 u Xi (x)\ < C 5 / s 2 ds = C 6 ti. 

Jo 

Hence 

\P t u Xi (x) ~ u Xi (x)\ 

OO OO _q 

< E \ p t/2»u xi {x) - iV+^(*)| < ^6^(^) 2 = C 7 W 2 = C 7 /i Q . 

k=0 k=0 

The same estimate holds for the remaining third term. The proof is complete. □ 

Remark 2.8. The statement does not hold for a = (see [20] 4.3.1). 
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Corollary 2.9. Under assumptions of Theorem \2.5\ the optimal transportation V^s 
is Holder continuous: 

\W^{x)-V V {y)\<C\x-y\ a 
with C depending only on p,q,A p ,A q . 

Proof. Replace v by ^^jHa with a bounded convex A. Clearly, the new potential 
satisfies assumptions of the theorem. Hence, using standard approximation argu- 
ments we can restrict ourselves to the case of compactly supported v. Thus we 
assume without loss of generality that |Vy| < K. By Theorem 12.51 

(11) ip(x + th) + <p(x - th) - 2tp(x) < C\t\ a 

for every unit vector h. We get immediately from Proposition ^ . 7l that V<p is Holder. 
To see that the constant does not depend on dimension, let as fix x, h £ K d and 
v £ R d with \h\ = \v\ = 1. Consider the restriction of <p onto the affine hyperplane 
A x ,v,h = {x + th + sv, s £ M, t £ K.}. This restriction clearly satisfies (TTT1) . Thus 
the 2-dimensional version of Lemma 12.71 implies that 

\{V<p(x + th)-V<p(x),v)\ = \(p v (x + th)-(p v (x)\ < C'\t\ a , 

where C (a corresponding "two-dimensional" constant) does not depend on d and 
on directions of h,v. Since this can be repeated for every x,h,v with the same 
constant C", one has 

\Vcp(x + th) - Vip(x)\ = sup \(V<p(x + th) - Vip(x),v)\ < C'\t\ a . 

M=i 

The proof is complete. □ 
Remark 2.10. A more general statement see in Corolarv 15.21 



3. Estimates for the second order derivatives: one-dimensional case 



We start with some heuristic estimates in the one-dimensional case. Consider a 
convex function <p such that <p sends fx = -j^ e 2 da; to a probability measure 



-w 



dx. We assume that W is sufficiently smooth. 



By the change of variables formula 



(12) 



log 2tt 



y = -T*V)+log¥>" 



Assume that f is a smooth function such that -Jf— 
point xq. One has at this point 

((p") 2 f(ip r ) tpW 3 <pW<p"f'(<p') 



admits its maximum at some 



<13) iwv 

Thus we get 



= 0, 



"\3 



(14) 

Applying (|14p we find 
(15) 
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(3) WW) 



L 

r- 



< o. 



^ (4) <(/) 3 [^ + (f)V). 



Differentiating (|T2"j) twice yields 

= 1- w'VXv ) 2 - w^> (3) + - (^r) 

Applying (fTlf and (|T5|) we get 



Now assume that / satisfies — A- + VF" + > jt- Thus we get at 
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< 1. 



Since xq is supposed to be the point of maximum for > we get a formal proof 



of the following statement: 

Assume that f and W satisfy 



f" 

J — + W" + W' J -r > 



f 



f ~ P 



The 



Of course, these arguments are non-rigorous. Nevertheless, they are applicable 
for many reasonable situations. 

Example 3.1. For the case W" > K and / = -7= we get a refinement of the 
Caffarelli's result: 

»<r 1 



Example 3.2. Assume that 



2a 1 



a,a] ■ 



Applying the arguments from above, we get the following estimate: 

9" < /V), 

where / : [—a, a] — > K + is a concave even function, decreasing of [0, a], such that 
f(a) = /(-a) = and 



-// =1. 



This function can be found explicitly: 



Here $ : [0, 1] ->■ 



,A 

' V 2 



/(t) = /(0)-$- 1 ( 

is defined by 
ds 



i/vT/(°)-* 



/(o) 



/o V— 21ns JV-21no 
In addition, a and /(0) are related by 



e 2 <ig. 



/(0) = a\/-- 
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Thus 



Note that this example can be obtained directly from the Caffarelli theorem. 
Indeed, set: 

r x ds 

G{x) = / = -f(x). 

Jo 

It can be verified by the direct computation that G pushes forward -^I[-a,a] to 
v = ^7rjg cts ■ 'yi [—6,6] 5 where b — G(a). Note that T = G(tp') is the optimal 
transportation of 7 to v. Hence, by the Caffarelli theorem, T is f-Lipschitz: T' < 1. 
Thus jfa< 1. 

4. Estimates for the second order derivatives: multidimensional case 

For the rest of the paper v is the uniform measure on a bounded convex domain 
K . The potential V is supposed to be at least two times differentiable with a Holder 
second derivative. 

Note that the assumption of convexity of K is crucial for smoothness of p. It 
is well known that for non-convex sets the potential tp is not smooth in general. 
Nevertheless, in our case the Caffarelli's regularity theory ensures that tp is smooth 
(see [7], [S]). Indeed, let us take any convex domain K' C K such that dK' lies 
positive distance from K. Let Kq = T^ 1 K' . It is easy to check that Kq is bounded 
(otherwise we get the contradiction with the monotonicity of T). Then applying a 
result from we immediately get that T is differentiable with a Holder continuous 
derivative inside of Kq. The further regularity follows from the smoothness of V 
by the classical arguments (see [7], Remark 4.15 in [25]). 

Definition 4.1. For a positive a and p £ K. let f(t) := f p , a (t) : [—a, a] — > M + be a 
function satisfying: 

1) //"+p(/') 2 = -l 

2) / is even and /(a) = 0. 

Assumptions l)-2) define uniquely a function /, which decreases on [0,a]. As- 
sume for a while that p > 0. First, taking / as a new variable and integrating I), 
one gets 

(if)"-!. 




This yields the following expression for /: 



/W = /(o)^-(v^), 

where W : [0, 1] — > [0, J Q 2 sin? a; efcc], 

In addition, a and /(0) are related by 

rp dr 



dr 



t VT^r 1 

*VP = /(0) / - /(0) / sini x dx. 

Jo vl-r Jo 
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The case p = has been considered in Example 13.21 The case p < is similar 
to p > 0. One has 



where $ : [1, oo] — > [0, J 2 sin 1 » x dx], 

*(*) = 

In addition, a and /(0) are related by 



aV=P = /(0) / nr — = /(0) / sin" 1 -* x dx. 
Ji \Jr l - 1 Jo 

Theorem 4.2. Le£ if C R d , d > 1 be a bounded convex set of a positive volume. 
Let Vip : M. d — > if 6e i/ie optimal transportation pushing forward probability measure 
/i = e~ v dx irai/i smooth V to the uniform measure x(k) ^ k ' Assume that Vhh < A 
for some h £ M. d with \h\ = 1. TTien one /ias 

Vhh < VA ■ fd^i a {(fi h - 1 ), 

where to is chosen in such a way that 

Li — {x : (x, h) = to — a}, L2 = {x : (x, h) = to + a} 

are supporting hyperplanes to K . 

If, in addition, we assume that \Vh\ < M then the following dimension-free 
estimate holds: 




Vhh < yA + 4 ^ ■ f P ,a(<Ph - h) 

for any p > — 1 . 

Proof. Note that the estimates are invariant with respect to any shift of the space. 
Let us shift K is such a way that K contains the origin. This clearly implies that 
lim^^oo ip( x ) — +00. We are looking for a maximum of Lphh{x)e^^ lfh ^ x ^ among all 
of x G M. d , with ip to be chosen later. To apply the maximum principle and make 
sure that the maximum is attained we deal with the following compactly supported 
modification: 

F e (x) = {l-e V ) + v hh (x)e^(*)). 

Consider the change of variables formula for ip 

C{K) + log detD 2 if = -V. 

Let y be a desired maximum point of F e . All the computations below are made at 
this point. First we change the coordinate system linearly in such a way that 

l)h = ei, 2) D 2 ip(y) is diagonal. 

The first requirement is achieved just by a rotation. In addition, without loss of 
generality we may assume that (D 2 (p)i.j, i,j > 2 is diagonal. To fulfill the second 
one we choose a non-orthogonal linear transformation. 

x 1 = xi H r^ x 2 H 1 TT x <i' 
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x\ = Xi, i G {2, • ■ ■ ,d}. 

Here (x[, ■ ■ ■ , x' d ) is the "new" coordinate system and (x\, • • • , Xd) is the "old" one. 
One checks easily that ip x > x >. = for i > 2. Since the transformation is represented 
by a unimodal matrix, the change of variables formula looks in new coordinates as 
follows: 

(16) C(K) +logdet Die = ~vU - ^f\x' 2 ^iM^, x> 2 , ■ ■ ■ , x' d ) , 

where D^ip denotes the matrix of second derivatives computed in new coordinate 
system. In what follows we set: x\ — ^^lfa) ,^ _ . . . _ fa) x > _ anc j wr j-t; e 

for simplicity Xj instead of x\ . 

Let us differentiate ate (jTHJ) . To this end it is convenient to use the following 
expression for the determinant 

det£>V= sign^Vzi^d, • • • Vx d x a{d) 

and taking into account that (p XiXj (y) = if i ^ j, we get at y: 

«-) - fi ^ a -( s ^)'- 

2 

^""^ ^KiaiiXia;! ^ fxiXiXi fxjXjX\ r : , j 



Differentiating (|16p twice in xi yields 

a ^ t/ en ^ f, j. n 2 1 (det D 2 ip) xix / (det D 2 ip) Xl \ 2 

-A < -V XlXl (Ox) = [log det I? ^ = detg2y ( detDV ) • 

It follows from {32}, JTHJl that 



Viii.xiii V~~^ ^PxiXjXi 



(19) _ A <^^ i x i x 1 x i _^ 

I I, J 3 3 

Since log-F e admits its maximum at y, one has at this point 

(20) - r ^^x 1 + ^^ + V%x 1 K 1 * 1 =o, 

(21) - i^i, H =0, i > 1. 

1 - e<p <Pxixi 

Differentiating log F £ twice in x\ yields 

2 

^ 2 , :rxixiXiXi / ^a;ia;i:i 

_ i fex 4 - ZzPxi H I 

(l-e<p) fen V fen 

+ t/j (fe)fe X( Xj < 0. 
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Multiplying this inequality on liCl , summing in i and applying (|19p one gets 
(22) —<Pw 1 x 1 -- / rsVxixA + ••• + M 

2 2 



TXiXjXi 



^ ^XiXi^XjXj ^ ^XiXi^PxiXi 



+ ip" {vxi)vl lXl + i> {fx 1 ) \Y] lfxiX * x * <p XlXl < A 



Note that 



2 2 2 

i,j VxiXi^XjXj . fxtxt^XiXi ie { 2 ,--- ,d} jE{l,--- ,d} ^ XiXi iXi 

2 2 

^ fxjXjXx ^ y-^ PxjXixi 

je{2,-,d} <e{2,-,d} 97x1X1 



One gets by (gj 



^-v yajajZi ^— v fx lXl Xi ^ ^a;.; 3; j xi ^-v £ <PxxX i ^2 

V>°**&*i*i i ^Vx iXi - . ,f-. d} . £{ ^ d} (1 - Sip) 2 Vxi a 



9, 



Hence it follows from (|22|) 

e 2 2 v-^ ¥>L 



+ i) (<p Xl )<p XlXlXl < A. 



iG{2, -rf} 

One obtains from (p?U| 



¥>X1X!X1 = 1 Vxi^XiX! -IP&xJtPx 

I — sip 



By the Cauchy inequality 



E^XjXjX! . [ \ ^ <)2xix;x; 1 . (,« i j , , ' , u 2 2 

ie{2,-,d} ie{2,-d} 4 

Hence 

Xi VxiXi 



1-^™ (i- e¥ >r 1 

+ [/(^ 1 )-^(V'(^ 1 )) 2 ]<* l <A. 
Applying the Cauchy inequality one gets 

Let ■i/'e be a smooth function on [infygx (x±, y), supj /g ^(xi, y)] satisfying 
(23) ^'>(l±^ +£ )(^) 2 + d V + e 2 ^. 
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The maximum principle implies 



e^-iV*!*! (!-£¥>)+ <max(e^^Vx 1 x 1 (l-e^)+) < 4 /A + e + 2e 2 sup(y,xi) 2 . 

Set: / E := e - ^ 1 . The differential inequality for ip e can be rewritten in the following 
way: 

(24) - fj: >(^- 1 + e) (f £ f + 1 + e^fl 



The following easy-to-check observation completes the proof: there exists a se- 
quence of nonnegative functions {f e } on [—a, a] satisfying (pMf such that f e —> 
f d-i a uniformly on [—a, a]. In the limit e — > we get from (|23|) 



xix 1 < VAfd-i (<p Xl ). 



The proof of the first statement is complete. 

To prove the second statement we use (|17p and (|22p to get 



2 

. ^PxiXirXiXi 

Since |T4J < M, arguing as above and passing to the limit e — ► 0, one gets the 
desired inequality for -0: 

ip" (fxjtpl^ ~ M\ip' ((p Xl )\<p XlX1 < A. 
Setting / = — log "0 and applying Cauchy inequality we obtain that for / satisfying 

-ff" + (l- 6 -M*)(ff>l 



one has tp XlXl < f(<p Xl )y A + jg. Let us set: — p = 1 — |M 2 . The proof is 
complete. □ 

Corollary 4.3. 1) Let /i = 7 &e £/ie standard Gaussian measure. One has 

diam(K) 



\D 2 ip\\ < Vd^l 



r • 

4 J 2 sin 11 - 1 a; ax 



2) Let = O i=1 Hi, where every pn is a copy of a measure Ho on K satisfying 
the following: a) fio = e~ v dx with smooth V, b) \V'\ < 1, V" < 1. For 
instance, one can choose V(x) to be equal \x\ for large values of \x\ and 
quadratic for small ones. Then 



\dM < J-p(i < d ^ diam(K) 



4(1 2jJ 2 sin -1- * x dx 



for any — 1 < p < 0. 
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5. Appendix : applications to functional inequalities 

The arguments presented in the following lemma have been communicated to 
the author by Sasha Sodin. It turns out that convexity condition allows to avoid 
the use of semigroup techniques in Proposition 12.71 



Lemma 5.1. For every convex f and unit vector h one has 

\Vf{x + th)- f{x)\<- sup (f(x + 2tv) + f(x-2tv)-2f(x)). 

1 v.\v\ = l y ' 

Proof. Let us start with d = 1. Note that 

,o 

f(x + 2t) + f(x - 2t) - 2f(x ) = / (f(x + s + 2t) - f(x + s))ds. 

Since /' is increasing, one has 

f'(x + s + 2t) - f'(x a + s) > f(x + 1) - f(x ) 
for every — t < s < 0. Thus 

,o 

f(x + 2t) + f(x - 2t) - 2f(x ) > J (f(x + s + 2t)- f(x + s))ds 

> J {f'(x +t)-f'(xo))ds = t{f'(x +t)-f'(x )). 

This completes the proof of the one-dimensional case. Let d > 2. For every couple 
of unit vectors u, h and t > one has by the cyclical monotonicity (see [2"5] ) 

(V/(a; + tu),th) + (Vf{x),tu) + (Vf{x + th), 0) 

< (V/(x + th), th) + (V/O), 0) + (V/(.t + tu), tu). 

Hence 

(V/O + tu) - Vf(x),th) < (Vf(x + th) - Vf{x), th) + (V/(x + tu) - V/O), tu). 
Then it follows from the one-dimensional case that 
(Vf(x + tu)-Vf(x),th) 

< - (f(x + 2th) + f(x - 2th) - 2/0)) + i (f(x + 2tu) + f(x - 2tu) - 2/0)) ■ 

It remains to take supremum over all unit vectors h, u. □ 

This estimate implies generalized continuity of the optimal transport for other 
types of uniform convexity. 

Corollary 5.2. Assume that W satisfies 

W(x + y) + W(x -y)- W(x) > 6(\y\) 

with some non-negative increasing function 6. Then 

|V^O)-V^(j/)| <8^ 1 (4| S -y| 2 ). 

Proof. Following the proof of Theorem l2.51 one easily finds that 

ip(x + th) + tp(x - th) - 2(f(x) < 2t6~ 1 {t 2 ). 

The result follows from Lemma T5. II □ 
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Corollary 5.3. Under assumptions of Corollary ] 5. S\ v — e w dx satisfies 

where $(i) = ^= JL^ e~~ ds. In particular, v admits a dimension-free concen- 
tration property 

v{A r )>l-\e^ 5(r/8)) 

with v(A) > 1/2. 

Proof. Consider the optimal mapping T pushing forward the standard Gauss- 

1 |x| 2 

ian measure 7 = e 2 dx to v. The statement follows immediately from 

Corollary 15.21 the Gaussian isoperimetric inequality (see [6]) and the estimate 
$(t) > 1 - \e~^. □ 

Finally, let us note that similar concentration results can be also derived from 
the so-called above-tangent lemma. The detailed description, references and various 
applications can be found in [25], we just mention briefly some important results. 
The above-tangent estimate have been used by Talagrand [22] for establishing trans- 
portation inequality for the standard Gaussian measure. It was understood later 
that similar arguments can be used for proving a broad class inequalities of Sobolev 
type. In particular, the Bobkov-Ledoux result has been generalized in [TT] (see also 
|14j). The corresponding isoperimetric inequalities have been proved in [T7] by lo- 
calization arguments. Finally, transportation approach to functional inequalities 
for non log-concave measures has been developed in [3] . 

We note that there exists another measure of convexity, which is especially con- 
venient when one deals with the above-tangent arguments. 

Remark 5.4. Everywhere below we deal with an arbitrary (non-Euclidean) norm 
|| • || on R d . 

For a convex W and || • || let us define S : M+ ->• M+ and b : R+ ->• M+ in the 
following way: 

5{t) = m£{w(x + y) + W{x - y) - 2W(x) : \\y\\ > tj 

(25) b(t) = M{w(x + y)- W{x) - (VW(x),y) : \\y\\ > t} 

These quantities are equivalent in a sense. 
Lemma 5.5. One has 

b(2t) - 2b(t) > S(t) > 2b(t). 
Proof. Relation 5(t) > 2b(t) follows from 
W{x + y) + W{x -y)- 2W{x) = 

(W(x + y)- W(x) - (VW(x), y)) + (w(x - y) - W(x) - (VW(x), -y)^ . 
Further, note that 

W(x + |) - W(x) - (Vff(4 I) > b(t/2). 
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Taking y with \y\ = t one has 

S(t/2) < W(x+y) + W(x)-2w(x+'~) < W(x+y)-W(x)-(VW(x),y)-2b(t/2). 
This clearly implies S(t/2) + 2b(t/2) < b(t). □ 

Definition 5.6. In what follows let 

1) & be the maximal convex function majorized by b 

2) b* be the corresponding convex conjugated function: 

b*{t) = sup{ts-fe(s)}. 

s>0 

The results of the following proposition are known (see [11]). For the reader 
convenience we just sketch the proof of 2). 

Proposition 5.7. Let W be a convex function such that v = e~ w dx is a probability 
measure. Consider a norm || • || on M. d and define b by A25\). Let f -v be a probability 
measure andT = \7ip be the optimal transportation mapping sending v to f -v. Then 
the following holds: 

1) Talagrand's type inequality: 



b(\\Vtp(x) - x\\)du < / flogf dv, 
2) a modified log-Sobolev-type inequality 

f /log/^< / &*( ^ )fdu, 
Jr" Jr* v *' 

where |j • ||* is the corresponding dual norm 

Proof. Let as use the above-tangent arguments. Let T — Vip be the optimal trans- 
portation sending / • v to v. By the change of variables formula 

= W(x) - W(Vip) + log det D 2 a ip - log f{x), 

where D\ip is the second Alexandroff derivative of ip (see [35] for details). Hence 

log / =W(x) - W(Vip) + log det D%tp = 

W{x) - W(Vip) - (VW(x),x- Vip) 

+ (VW(x),x- V(p) + log det D^ip 

< -b(\\Vtp(x)-x\\) + (VW(x),x-V<p) + log det D^- 

Let us integrate this formula with respect to f -v. Integration by parts and convexity 
properties of tp give 



f\ogfdv<- b(\\V<p(x)-x\\)fdv+ (Vf(x),x-V<p)dP. 

JR d JR d 



It is well-known and easy to verify that TrD^ip — d — log det D\<p> > 0. Applying 

\ f[x) I f{x) * 

and the Young inequality one completes the proof. □ 
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Finally, the following concentration inequality 



u(A r ) > 1 - 2e~ 2i ^ 2 \ 

v(A)>-, A r = {y: \\x - y\\ >r} 

can be easily verified by a modification of Marton's arguments. Let Tj = Vyi, 
T2 = V<^2 be a couple of optimal transportation mappling pushing forward v to 
v\ = 7^4) and ^2 = v Aa^ v \a% respectively Then T = T2 o Tj~ sends 1^1 to v-i- 
One has 

b{r/2)<{ b(h\x-T(x)\\)d Vl = [ ft^HTi-TallW 
Hence by convexity of b and the Talagrand's type estimate 

26(r/2) < / &(||T 1 (a;)-2:||W + / &(||T 2 (x) - x||)di/ 




Hence 

aS(r) < 1 

This work was supported by the RFBR project 07-0f-00536, GFEN-06-01-39003, 
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